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Abstract 

Identifying highly susceptible individuals in spreading processes is of great 
signihcance in controlling outbreaks. In this paper, we explore the suscepti¬ 
bility of people in susceptible-infectious-recovered (SIR) and rumor spreading 
dynamics. We hrst study the impact of community structure on people’s sus¬ 
ceptibility. Despite that the community structure can reduce the infected 
population given same infection rates, it will not deterministically affect 
nodes’ susceptibility. We hnd the susceptibility of individuals is sensitive 
to the choice of spreading dynamics. For SIR spreading, since the suscepti¬ 
bility is highly correlated to nodes’ influence, the topological indicator /c-shell 
can better identify highly susceptible individuals, outperforming degree, be¬ 
tweenness centrality and PageRank. In contrast, in rumor spreading model, 
where nodes’ susceptibility and influence have no clear correlation, degree 
performs the best among considered topological measures. Our hnding high¬ 
lights the signihcance of both topological features and spreading mechanisms 
in identifying highly susceptible population. 
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1. Introduction 

Developing efficient methods to prevent epidemic outbreaks or acceler¬ 
ate information dissemination is the ultimate goal of research on spread¬ 
ing dynamics across various domains [1-15]. For diseases and information 
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that spread through social networks, the structure of underlying contact net¬ 
work can greatly affect the spreading processes [13-15]. Previous studies 
have inspected the role of topology in epidemic threshold and critical be¬ 
havior. For example, in uncorrelated networks, the epidemic threshold is 
A = (fc)/((/c^) — (/c)), where (k) and are the first and second moments 
of the degree distribution [14]. Later on, in view of the wide occurrence of 
intrinsic heterogeneous complex networks in real world, researchers started 
to investigate the effect of microstructure on spreading dynamics in more 
detailed perspectives: the community structure appearing in social networks 
[16], fc-shell decomposition of networks for identification of influential spread¬ 
ers [3], link salience skeleton correlated with the frequency of a link’s appear¬ 
ance in infection hierarchies [17], as well as weak ties which is significant for 
information dissemination [18]. 

Due to the existence of complex microstructure, individuals with different 
topological features should play distinct roles during the spreading processes. 
Previous studies mainly focus on the spreading ability of individnals. Many 
topological measnres are employed to identify inflnential spreaders in net¬ 
works, snch as degree [4, 13], betweenness [19], fc-shell index [3, 20], PageR- 
ank [21, 22], etc. In fact, individuals situated in different positions of the 
contact network will also vary in susceptibility to epidemics. Identifying the 
highly snsceptible individuals in contact networks is of great signihcance in 
controlling epidemic outbreaks. Therefore, it is indispensable to explore the 
factors that affect people’s susceptibility. 

In this paper, we study the susceptibility of individuals in SIR and rumor 
spreading dynamics through extensive simulations in real-world social net¬ 
works. First, we explore the impact of community structure on people’s sus¬ 
ceptibility. Even though the community structure can diminish the infected 
population and slow down the spreading processes, it is not the pivotal factor 
affecting nodes’ susceptibility. Although the location of epidemic source has 
an impact on the precise infected probability of each single person, there exist 
a gronp of nodes that can always get infected with relatively high probabili¬ 
ties no matter where the spreading originates. We are particularly interested 
in these populations and consider them as highly susceptible individuals. To 
quantitatively depict the property of these people, we define the susceptibil¬ 
ity of each individual as the average probability to be infected by a randomly 
chosen spreading source. Then we further explore the topological properties 
of the highly susceptibility individuals. By examining their topological traits 
including degree, fc-shell, betweenness centrality and PageRank, we hnd the 
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Table 1: Statistics of social networks. We display the node number N, link number L, and 
average degree {k) for Facebook and Enron social networks. The critical infection rate of 
SIR model fic is calculated by (fc)/((fc^) — (fc)). fis and [3r are adopted infection rates in 
SIR and rumor spreading simulations respectively. 

,, , , Topological statistics „ n n 

Network - ^/3c (Js Pr 

Facebook 4,039 88,234 43.7 0.0095 0.03 0.3 

Enron 36,692 367,662 20.0 0.0072 0.02 0.3 


susceptibility of individuals is sensitive to the choice of spreading dynamics. 
For SIR spreading, the susceptibility is highly correlated to nodes’ influence. 
Therefore, nodes located in the core region of networks are more likely to 
be infected. However, in rumor spreading model, degree can better iden¬ 
tify highly susceptible individuals. Our results indicate that it is necessary 
to acquire the information of spreading mechanism to better locate highly 
susceptible population in practice. 

2. Impact of community structure on spreading processes 

Most of social networks in reality have notable community structure. We 
hrst explore the impact of community structure on individuals’ probabilities 
to be infected in spreading processes. We examine the epidemic flows in 
two social networks with community structure: (1) the friendship network 
between users of the social network Facebook (Facebook) [23], (2) the Enron 
email communication network (Enron), in which nodes are email addresses 
and edges represent email communications [24]. These social networks have 
been used in previous studies. Topological statistics of networks are shown 
in Table 1. In our following study, we treat all these networks as undirected. 

The community structure is extracted by a fast heuristic method based on 
modularity optimization [25]. The layout of community structure for Face- 
book social network is shown with different colors in Fig. 1(a). To simulate 
spreading processes among populations, we hrst apply susceptible-infectious- 
recovered (SIR) [26, 27] model on the above social networks. In SIR model, 
the population can be classihed into three possible states: susceptible (S), 
infected (I), or recovered (R). Starting from a single epidemic source node. 
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Fig. 1: SIR epidemic spreading in Facebook social network. The infection rate is set as 
Ps = 0.03. (a), The layout of community structure for Facebook social network. Eight 
communities are distinguished by different colors, (b, c, d), Distributions of infected 
probability for spreading sources in different communities. The solid black dots represent 
outbreak origins. The infected probability is obtained by averaging over 10® SIR real¬ 
izations. The colors indicate the logarithmic values (base 10) of infected probability for 
corresponding individuals. 
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at each time step, the infected individuals (I) would infect their susceptible 
neighbors (S) with probability fis and then become recovered (R) and gain 
permanent immunity with probability A. Such process continues until there 
are no infected individuals in the system. In our simulations, without loss 
of generality, we set A = 1. Previous studies have adopted this model to de¬ 
scribe diffusion of contagious diseases. For information dissemination, recent 
works have shown that, although some information spreading processes are 
greatly affected by human-related factors [28-32], a few information memes 
showing viral behavior can also spread like infectious diseases [33]. Hence the 
SIR model is applicable not only to epidemics but also to those information 
memes which are not seriously affected by human-related factors. 

In simulations, the selection of infection rate fis could signihcantly affect 
the spreading outcomes. Below the critical value /^c = (/c)/((P) — (/c)), the 
proportion of infected individuals will vanish in the limit of a very large 
population. For f^s much larger than Pc, the epidemic will infect almost all 
the people in the network. In our study, we use relatively small values of 
Ps to guarantee medium scale infection coverage. Otherwise most of the 
nodes will be infected with high probabilities, making their susceptibility 
indistinguishable. In Table 1, we report the critical infection rate Pc and 
adopted value Ps in our simulations. The infection rate Ps is slightly above 
its corresponding critical value Pc to generate moderate coverage. 

For Facebook network, we select three nodes in different communities as 
spreading sources and apply 10® realizations of SIR model. The infected 
probability of each individual is approximated by the frequency of getting 
infected in these realizations. In Fig.l(b-d), we display the distribution of 
infected probability for Facebook social network. The black dots represent 
outbreak origins and the infected probability is indicated by colors. Clearly, 
the epidemic permeates to several communities. Moreover, during all the 
epidemic spreading, there exist a group of individuals retaining relatively 
high infected probabilities, regardless of the positions of spreading sources. 

In order to check the effect of community structure on spreading pro¬ 
cesses, we compare the evolution of infected proportion p in networks with 
and without community structure. For Facebook network, we rewire the 
links to destroy the community structure. Specihcally, we randomly select 
two links {A,B) and {C,D), and then swap the end nodes to {A,D) and 
{B,C). This procedure breaks down the communities but preserves the de¬ 
gree distribution. In our simulations, we perform 10® rewire operations. The 
result of SIR model is shown in Fig. 2(a). Under same conditions, the rewired 
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Fig. 2: Community structure will reduce the infected population and slow down the spread¬ 
ing processes. For Facebook and rewired Facebook social networks, we perform SIR model 
with Ps = 0.03. The evolution of infected proportion p is displayed in (a). In (b), we 
present the infected proportion p versus time t for rumor spreading model with Pn = 0.3. 


Facebook network has a much higher infected proportion, which implies the 
community structure restricts the spreading processes to some extent. Mean¬ 
while, it will take a longer time for Facebook network with community struc¬ 
ture to achieve a given infected proportion p. Therefore, the community 
structure will slow down the speed of SIR spreading. 

To examine whether our finding is applicable to other spreading dynam¬ 
ics, we perform the same analysis on rumor spreading model [8]. In rumor 
spreading model, each individual can be in three possible states: the spreader 
(S), ignorant (I), and stifler (R). Spreaders represent nodes that are aware of 
the rumor and are willing to transmit it. Ignorant people are individuals un¬ 
aware of the rumor. Stiflers stand for those that already know the rumor but 
are not willing to spread it anymore. In each time step, the spreaders contact 
one of their neighbors randomly and turn the ignorant ones into spreaders 
with probability (Pr- If the spreaders encounter spreaders or stiflers, they will 
turn to stiflers with probability A. We also set A = 1 in our simulations. We 
obtain similar results in Fig. 2(b). 

3. Susceptibility of individuals in spreading processes 

During spreading processes, the exact values of infected probability de¬ 
pends on the choice of spreading sources. But in real-world applications, 
usually we have no information about the locations of spreading origins, 
which makes it impractical and intractable to obtain individuals’ exact in¬ 
fected probability during an epidemic outbreak. In spite of that, there exist 
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a group of nodes exhibiting relatively high infected probability regardless of 
the position of source (see Fig.l(b-d)). In our work, we will focus on these 
persons who would be infected easily from a randomly chosen spreading ori¬ 
gin. We dehne the source-independent susceptibility Si of a given individual i 
as the average infected probability over epidemics starting from all the nodes 
in the network: 

i=i i=i 

where N is the number of nodes in the network, and T is the number of 
spreading realizations originating from each node. In our research, we set 
T = 10^. Besides, denotes whether node i gets infected {irfj = 1) or 
not (tt^ = 0) during the spreading realization originating from node j. 
Accordingly, represents the average infected probability of 

node i during the spreading from node j. We should note that this dehnition 
of susceptibility is different from that in [34], where the susceptibility is de- 
hned as the nature of individuals, such as immune system, physical condition 
or learning ability, etc. According to our dehnition, on average, nodes with 
higher susceptibility Si are more likely to be infected during an epidemic 
wherever the contagion starts. 

For SIR spreading model, we can calculate the susceptibility with the 
standard set of equations of SIR dynamics. For a network with N nodes, the 
topology is recorded by the adjacency matrix A = {aij}NxN, where ajj = 1 if 
node i and j are connected, and aij = 0 otherwise. We denote the probability 
of node i being in susceptible, infected and recovered state at time t as pf(t), 
Piit) and pf(f) respectively. For networks without too many short loops, we 
assume the neighbors of one node can independently get infected at time t. 
Then the evolution of dynamics follows 

N 

-[1 - “ PsaijPj{t)pf{t))], ( 2 ) 

i=i 

N 

[1 - “ PsaijPj{t)pf{t))]-pl{t), (3) 

t=i 

piit)- (4) 

)f adjacency matrix A. In case of small infection 


dpfjt) 

dt 

dpi jt) 
dt 

dpfjt) 

dt 

Here aij is the element 
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rate f3s-, the term [1 — n^i(l — f3saijp^j{t)p^{t))] can be approximated by 
PsPi {t)- In fact, there are only two independent equations 

since pfit) +pl (t) +pf (t) = 1. We discretize the above equations and obtain 
two independent ones 

N 

pi (^ +1) = /^s(i - pi(t) - pf{t)) (5) 

i=i 

pf(^ + l) = pfif)+pi{t)- (6) 

For simplicity, the above equations can be transformed into a matrix 

form. Denote p^{t) as the vector (p^it), ■ ■ ■ ,pj^{t))'^, p^{t) as the vec¬ 
tor (p^(t), • • • ,p^(f))^, and P^(t) as the diagonal matrix diag{l — plit) — 
Poit), • • •, 1 — pj^{t) — p^{t)). Then we have 

p’(t+l) = PsP‘’(t)Ap’{t), (7) 

p^f + l) = p^{t)+p‘{t). (8) 

Therefore, we obtain the evolution functions for p^{t) and p^{t). Since 
we select the spreading source randomly, the initial condition is p^{0) = 
{1/N, • • •, 1/N) and p^(0) = ((iV - 1)/N, - 1)/N). The suscepti¬ 
bility vector S = {Sq, ■ ■ ■, = p^{oo). Here oo indicates the spreading 

dynamic evolves for a long enough period of time so that no infected individ¬ 
uals are left in the system. In case of sparse networks, the iteration converges 
very quickly. Therefore the susceptibility can be calculated by this method 
efficiently. 

We verify the calculation on Erdos-Renyi (ER) random networks. We 
construct the ER random network with size iV = 10^ and average degree 
(k) = 5 by randomly assigning 25, 000 links among 10^ nodes. Then we 
perform SIR simulations to obtain simulated susceptibility Si and calculate 
the theoretical value through iterations. For (3s = 0.1, the theoretical 
values agree well with the simulation results (see Fig. 3(a)). However, for 
(3s = 0.25, which is above the threshold value (3c = 0.2, theoretical values 
are considerably larger than simulation values (see Fig. 3(b)). This discrep¬ 
ancy can be explained by the invalidation of assumptions. For a relatively 
large infection rate (3s, the epidemic would occupy a macroscopic fraction of 
population. Under this condition, the assumption that a node’s neighbors 
are infected independently becomes invalid. Therefore, the exact probability 
of a node i to be infected by its neighbors at time t should be smaller than 



Fig. 3; For ER random network {N = 10'’’, (k) = 5), the simulation values of Si and 
theoretical predictions are displayed in (a) (/3s = 0.1) and (b) (/3s = 0.25). The slope 
of simulation values versus theoretical predictions for different /3s is shown in (c). The 
inset presents their correlation coefficient. In (d), we show the residual error of theoretical 
predictions for various /3s- 
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(^sPi(t)'^f=iCiijPj(t), which implies our method overestimates nodes’ sus¬ 
ceptibility. Even though the calculation has the problem of overestimation, 
the theoretical predictions and simulation values follow a straight line. We 
£t this line to Si = kSf^ for different jig G [0, 0.3] and examine the correla¬ 
tion coefficient between Si and Sf^ in the inset of Fig. 3(c). It can be seen 
that theoretical predictions are highly correlated with simulated susceptibil¬ 
ity. However, the slope k varies as (3s increases. For (3s < 0.15, the slope 
k remains around 1, which means theoretical values are quite accurate. As 
(3s approaching to (3c = 0.2, the slope k decreases dramatically. This implies 
Sf^ becomes larger than S'*, even though they still have a linear relationship. 
This phenomenon is also reflected in Fig.3(d), in which we display the resid¬ 
ual error R = \ Si — S(^\ for different (3s- With (3s increasing to (3c., the 

residual error R rises abruptly. Therefore, our method predicts susceptibility 
accurately for infection rate (3s smaller than threshold value (3c, and can be 
used to rank susceptibility for (3s near or larger than (3c- 

4. Topological indicators for highly susceptible individuals 

Although we have calculated the susceptibility with the standard set of 
equations of spreading dynamics, it is still convenient to discern highly sus¬ 
ceptible individuals with topological indicators in practice. Here we explore 
some of the most important topological measures in network theory. The 
most straightforward and simple one is degree k, i.e. the number of con¬ 
nections a node maintains. Hubs (high degree nodes) are believed to play 
significant roles in many dynamical processes. More connections may lead 
to a higher chance to be infected in an epidemic spreading. However, degree 
only reflects nodes’ number of neighbors. In order to check people’s global 
location in networks, we also examine the fc-shell index kg by fc-shell decom¬ 
position analysis [3, 35-39]. The fc-shell decomposition can be described as 
follows: hrstly we start by recursively removing nodes with degree k = 1 
until there are only nodes with degree k > 1 left in the network. All the 
nodes removed in this procedure are assigned with kg = 1- In a similar 
way, we iteratively hnd out higher /c-shells until all nodes are pruned. In 
this procedure, each node will be assigned a unique /c-shell index kg- Nodes 
belonging to high (low) fc-shells are located in the core (periphery) area of 
the network. Another topological measure, betweenness centrality, quantihes 
the number of shortest paths passing through each node. In social science. 
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the betweenness centrality of node i, denoted by BC{i) is defined as 


BC({) = y ^ 


( 9 ) 


where agt is the number of shortest paths between nodes s and t, and Cstii) 
is the number of shortest paths between s and t which pass through node i. 
Consequently, nodes with large betweenness centrality usually hold the vital 
positions in the shortest pathways between large numbers of nodes pairs. In 
addition, considering the importance of nodes’ neighbors, PageRank, which 
is originally introduced to rank web pages, is proposed [21], The PageRank 
of a node i in a network can be calculated from 



( 10 ) 


where a^- is the element of adjacency matrix A and d is the jumping prob¬ 
ability. In our calculation, as in many previous studies, we set d = 0.15 
conventionally. To sum up, we have chosen four typical topological mea¬ 
sures proposed from different perspectives: degree k, /c-shell ks, betweenness 
centrality BC and PageRank. 

First, we compare the performance of /c-shell and degree for SIR model. 
For individuals with a given combination of /c-shell and degree {ks,k), we 
check the average susceptibility S{ks, k): 



( 11 ) 


Here T{ks,k) is the collection of all nodes with fc-shell and degree {ks,k), 
and N{ks,k) is the number of nodes in the set T{ks,k). The susceptibil¬ 
ity Si of node i is calculated by averaging over 10^ SIR realizations in each 
social network. We display the logarithmic value (base 10) of S{ks,k) in 
Fig. 4(a), (d), (g) and (j) for Facebook, Enron, rewired Facebook and rewired 
Enron social networks respectively. It is observed that most highly suscep¬ 
tible individuals occupy high /c-shells. For nodes with a hxed degree k, the 
susceptibility can be either large or small, depending on their /c-shell values. 
Meanwhile, the individuals located in the same ks shell have similar suscep¬ 
tibility. This implies, /c-shell index could better reflect nodes’ susceptibility 
than degree k. We also perform same analyses for betweenness centrality 
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Fig. 4: Comparison of different topological indicators for SIR model. For Facebook (a, 
b, c), Enron (d, e, f), rewired Facebook (g, h, i) and rewired Enron (j, k, 1) networks, 
we plot the average values of logarithmic susceptibility (base 10) for nodes with combined 
k-shell ks and degree k (a, d, g), betweenness centrality BC (b, e, h) and PageRank (c, f, 
i) respectively. The infection rate /3s is 0.03 aird 0.02 for Facebook and Enron networks. 
Values are indicated by colors, which are shown in color bars. 
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Fig. 5: The recognition rate r(/) for different fraction / in SIR model. For Facebook 
(a), Enron (b), rewired Facebook (c) and rewired Enron (d) social networks, we display 
the recognition rate curves r{f) for k-shell ks, degree k, betweenness centrality BC and 
PageRank. The fraction / ranges from 0.05 to 0.5. The infection rate /3s in SIR simulations 
is 0.03 and 0.02 for Facebook and Enron networks. 


BC (see Fig.4(b),(e),(h),(k)) and PageRank (see Fig.4(c),(f),(i),(l)). Consis¬ 
tently, /c-shell index ks outperforms betweenness centrality and PageRank in 
predicting the average susceptibility of individuals. 

We should also note that our results are not affected by the community 
structure. We compare our results in the original and rewired networks in 
Fig.4. The conclusion is not sensitive to the existence of community struc¬ 
ture. No matter where the node locates in the network (between communities 
or within communities), it is susceptible to SIR spreading processes as long 
as it has a high /c-shell index. 

Although /c-shell can better predict the average susceptibility of individ¬ 
uals, since the susceptibility for nodes with same topological measures has 
fluctuations, it is still desirable to explore their accuracy in locating highly 
susceptible persons. To this end, we use the recognition rate r(/) to quan¬ 
tify the efficiency in identifying individuals of high susceptibility [38]. The 
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recognition rate r(/) is defined as 


'■(/) 


|S/I 


( 12 ) 


where Sf and Mj are the sets of nodes ranking in top / percentage by sus¬ 
ceptibility and topological measures, and | • | is the number of elements in the 
set. In fact, recognition rate r(/) indicates the proportion of highly suscepti¬ 
ble individuals which can be predicted by each measure. The higher r(/) is, 
the more accurate the predictor is. In Fig. 5, we display the recognition rate 
r(/) (/ e [0.05,0.5]) for Facebook, Enron, rewired Facebook and rewired 
Enron networks. In most cases, fc-shell index ks and degree k outperform 
betweenness centrality BC and PageRank. And ks performs better than 
degree k in general, except when the fraction / is relatively small for Face- 
book and rewired Facebook networks. This can be explained by the fact that 
in these networks, nodes with extremely large degree tend to locate in the 
core region. Therefore, if we pick high-degree nodes, they will also have high 
ks- However, the number of such hubs is not too large. As we increase the 
fraction /, ks could still identify more highly susceptible individuals. This 
phenomenon depends on the exact topological structure of networks. For 
instance, for tree-like networks, ks only contains a few values. Under this 
condition, degree k can clearly better distinguish highly susceptible individ¬ 
uals. Whereas, such simple topological structure usually does not appear 
in large scale real-world social networks. On the contrary, realistic social 
networks posses rather complex structure, in which hubs are not necessarily 
located in the dense core. So in real scenario, ks is still an effective predictor 
for highly susceptible individuals. 

As pointed out in the supporting information of Ref. [3], for SIR model, 
nodes’ influence and infected probability are highly correlated. We present 
the relationship between susceptibility S and influence M of SIR model in 
Fig.6(a). Influence Mi is dehned as the average size of the population infected 
in a spreading process originating at node i. Since fc-shell outperforms other 
topological measures in identifying influential spreaders for SIR model [3, 38], 
it should be also effective in locating highly susceptible individuals. This 
explains the efficacy of fc-shell for SIR model. In fact, for any independent- 
interaction spreading models in which susceptibility and influence are closely 
correlated, such as SIR and SIS model, fc-shell should be a good topological 
indicator for highly susceptible people. 
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Fig. 6: The relationship between susceptibility S and influence M of SIR model is presented 
in (a). The infection rate /3s is set to be 0.03. We also display susceptibility S versus 
influence M of rumor spreading model in (b). /3 _r is set to be 0.3. 


However, for rumor spreading model, we find the susceptibility and in¬ 
fluence have no clear correlation, as shown in Fig. 6(b). In rumor spreading 
model, influential spreaders are not necessarily highly susceptible to spread¬ 
ing processes. It has been found that /c-shell cannot represent nodes’ spread¬ 
ing capability in rumor spreading model [9]. Then it is desirable to hnd the 
topological indicator of susceptibility in rumor spreading model. Since the 
hgure like Fig. 4 cannot distinguish the performance of some measures clearly, 
we directly display the recognition rate for each topological indicator in Fig. 7. 
For all considered networks, degree is capable of recognizing more high-risk 
people, which differs from the better performance of fc-shell for SIR model. 
Also, the community structure will not affect the prominent performance of 
degree. Whereas, after rewiring links, the recognition rates for all indicators 
are enhanced. This implies the existence of community structure undermines 
the effectiveness of topological indicators. 

In fact, the different results obtained in SIR model and rumor spreading 
model can be explained by the spreading mechanisms. In SIR model, the 
superior performance of fc-shell relies on the cascading behavior of spreading 
processes. Starting from the source, the epidemic will diffuse from person 
to person contagiously and may eventually spread through the majority of 
population in a “viral” way. Usually, a spreading process will last for many 
generations before it dies out. For nodes located in the core region, not 
only their neighbors are prone to be infected, but also their neighbors of 
neighbors have large chances to be infected, and so forth. Consequently, k- 
shell can better describe nodes’ susceptibility in SIR model. On the contrary. 


15 






Fig. 7: The recognition rate r(/) for different fraction / in rumor spreading model. For 
Facebook (a), Enron (b), rewired Facebook (c) and rewired Enron (d) social networks, we 
display the recognition rate curves r{f) for k-shell ks, degree fc, betweenness centrality 
BC and PageRank. The fraction / ranges from 0.05 to 0.5. The infection rate ISr in 
rumor spreading simulations is 0.3 for all networks. 
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in rumor spreading model, once a spreader encounters a spreader or stifler 
neighbor, it will stop spreading the information. This dramatically restrains 
the cascading behavior. It has been found that nodes with high /c-shell values 
serve as “hrewalls” of rumor spreading [9], i.e., once these nodes are infected, 
they will soon turn into stiflers. Under this situation, high fc-shell nodes will 
have lower chances to be infected due to their “hrewall” neighbors. Since 
the cascading behavior is restricted, it is sufficient to consider only one-step 
neighbors. Therefore, degree outperforms fc-shell in rumor spreading model. 

5. Conclusion and Discussion 

In this paper, we investigate individual’s susceptibility in spreading pro¬ 
cesses. We hrst study the impact of community structure on people’s sus¬ 
ceptibility. We hnd that, although the community structure can reduce the 
infected population and slow down the spreading processes, it will not affect 
nodes’ susceptibility signihcantly. For SIR model, we hnd the topological 
indicator fc-shell can better identify highly susceptible individuals, outper¬ 
forming degree, betweenness centrality and PageRank. This can be explained 
by the high correlation between nodes’ susceptibility and inhuence. On the 
contrary, for rumor spreading model, nodes’ susceptibility and inhuence have 
no clear correlation. Degree performs the best among considered topological 
measures. We give an explanation of the diherent results of SIR and rumor 
spreading models based on spreading mechanisms. In general, for spread¬ 
ing processes that can trigger large-scale cascading behaviors, such as SIR 
model, nodes in the core region should be more susceptible since they tend 
to have highly susceptible neighbors, neighbors of neighbors, and so forth. 
In contrast, in spreading models that rely on short-range dihusion, degree 
should be a good choice of indicator for susceptibility. Our hnding indicates 
that, when identifying highly susceptible individuals, it is necessary to take 
both topological features and spreading mechanisms into account. 
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